Product Code Database
Example Keywords: pajamas -the $91
barcode-scavenger
   » » Wiki: Sigmoid Function
Tag Wiki 'Sigmoid Function'.
Tag

A sigmoid function is any mathematical function whose graph has a characteristic S-shaped or sigmoid curve.

A common example of a sigmoid function is the logistic function, which is defined by the formula

\sigma(x) = \frac{1}{1 + e^{-x}} = \frac{e^x}{1 + e^x} = 1 - \sigma(-x).

Other sigmoid functions are given in the Examples section. In some fields, most notably in the context of artificial neural networks, the term "sigmoid function" is used as a synonym for "logistic function".

Special cases of the sigmoid function include the (used in modeling systems that saturate at large values of x) and the (used in the of some ). Sigmoid functions have domain of all , with return (response) value commonly monotonically increasing but could be decreasing. Sigmoid functions most often show a return value ( y axis) in the range 0 to 1. Another commonly used range is from −1 to 1.

A wide variety of sigmoid functions including the logistic and hyperbolic tangent functions have been used as the activation function of artificial neurons. Sigmoid curves are also common in statistics as cumulative distribution functions (which go from 0 to 1), such as the integrals of the , the , and Student's t probability density functions. The logistic sigmoid function is invertible, and its inverse is the function.


Definition
A sigmoid function is a , differentiable, real function that is defined for all real input values and has a positive derivative at each point.


Properties
In general, a sigmoid function is monotonic, and has a first which is bell shaped. Conversely, the of any continuous, non-negative, bell-shaped function (with one local maximum and no local minimum, unless degenerate) will be sigmoidal. Thus the cumulative distribution functions for many common probability distributions are sigmoidal. One such example is the , which is related to the cumulative distribution function of a normal distribution; another is the function, which is related to the cumulative distribution function of a Cauchy distribution.

A sigmoid function is constrained by a pair of horizontal asymptotes as x \rightarrow \pm \infty.

A sigmoid function is for values less than a particular point, and it is for values greater than that point: in many of the examples here, that point is 0.


Examples
  • Logistic function f(x) = \frac{1}{1 + e^{-x}}
  • Hyperbolic tangent (shifted and scaled version of the logistic function, above) f(x) = \tanh x = \frac{e^x-e^{-x}}{e^x+e^{-x}}
  • Arctangent function f(x) = \arctan x
  • Gudermannian function f(x) = \operatorname{gd}(x) = \int_0^x \frac{dt}{\cosh t} = 2\arctan\left(\tanh\left(\frac{x}{2}\right)\right)
  • f(x) = \operatorname{erf}(x) = \frac{2}{\sqrt{\pi}} \int_0^x e^{-t^2} \, dt
  • Generalised logistic function f(x) = \left(1 + e^{-x} \right)^{-\alpha}, \quad \alpha > 0
  • function f(x) = \begin{cases}
{\displaystyle \left( \int_0^1 \left(1 - u^2\right)^N du \right)^{-1} \int_0^x \left( 1 - u^2 \right)^N \ du}, & |x| \le 1 \\ \\ \sgn(x) & |x| \ge 1 \\ \end{cases} \quad N \in \mathbb{Z} \ge 1
  • Some algebraic functions, for example f(x) = \frac{x}{\sqrt{1+x^2}}
  • and in a more general form f(x) = \frac{x}{\left(1 + |x|^{k}\right)^{1/k}}
  • Up to shifts and scaling, many sigmoids are special cases of f(x) = \varphi(\varphi(x, \beta), \alpha) , where \varphi(x, \lambda) = \begin{cases} (1 - \lambda x)^{1/\lambda} & \lambda \ne 0 \\e^{-x} & \lambda = 0 \\ \end{cases} is the inverse of the negative Box–Cox transformation, and \alpha < 1 and \beta < 1 are shape parameters.
  • Smooth transition function normalized to (−1,1):

\begin{align}f(x) &= \begin{cases} {\displaystyle \frac{2}{1+e^{-2m\frac{x}{1-x^2}}} - 1}, & |x| < 1 \\ \\ \sgn(x) & |x| \ge 1 \\ \end{cases} \\ &= \begin{cases} {\displaystyle \tanh\left(m\frac{x}{1-x^2}\right)}, & |x| < 1 \\ \\ \sgn(x) & |x| \ge 1 \\ \end{cases}\end{align} using the hyperbolic tangent mentioned above. Here, m is a free parameter encoding the slope at x=0, which must be greater than or equal to \sqrt{3} because any smaller value will result in a function with multiple inflection points, which is therefore not a true sigmoid. This function is unusual because it actually attains the limiting values of −1 and 1 within a finite range, meaning that its value is constant at −1 for all x \leq -1 and at 1 for all x \geq 1. Nonetheless, it is (infinitely differentiable, C^\infty) everywhere, including at x = \pm 1.


Applications
Many natural processes, such as those of complex system , exhibit a progression from small beginnings that accelerates and approaches a climax over time. When a specific mathematical model is lacking, a sigmoid function is often used.

The van Genuchten–Gupta model is based on an inverted S-curve and applied to the response of crop yield to .

Examples of the application of the logistic S-curve to the response of crop yield (wheat) to both the soil salinity and depth to in the soil are shown in .

In artificial neural networks, sometimes non-smooth functions are used instead for efficiency; these are known as .

In audio signal processing, sigmoid functions are used as transfer functions to emulate the sound of clipping.

In and , the Hill and Hill–Langmuir equations are sigmoid functions.

In computer graphics and real-time rendering, some of the sigmoid functions are used to blend colors or geometry between two values, smoothly and without visible seams or discontinuities.

between strong acids and strong bases have a sigmoid shape due to the logarithmic nature of the .

The logistic function can be calculated efficiently by utilizing type III Unums.

An hierarchy of sigmoid growth models with increasing complexity (number of parameters) was built with the primary goal to re-analyze kinetic data, the so called N-t curves, from heterogeneous experiments, in . The hierarchy includes at present three models, with 1, 2 and 3 parameters, if not counting the maximal number of nuclei Nmax, respectively—a tanh2 based model called α21 originally devised to describe diffusion-limited crystal growth (not aggregation!) in 2D, the (JMAK) model, and the Richards model. It was shown that for the concrete purpose even the simplest model works and thus it was implied that the experiments revisited are an example of two-step nucleation with the first step being the growth of the metastable phase in which the nuclei of the stable phase form.


See also
  • HELIOS Hybrid Evaluation of Lifecycle and Impact of Outstanding Science


Further reading
  • (1997). 9780070428072, WCB McGraw–Hill.
    . (NB. In particular see "Chapter 4: Artificial Neural Networks" (in particular pp. 96–97) where Mitchell uses the word "logistic function" and the "sigmoid function" synonymously – this function he also calls the "squashing function" – and the sigmoid (aka logistic) function is used to compress the outputs of the "neurons" in multi-layer neural nets.)
  • (NB. Properties of the sigmoid, including how it can shift along axes and how its domain may be transformed.)


External links
Page 1 of 1
1
Page 1 of 1
1

Account

Social:
Pages:  ..   .. 
Items:  .. 

Navigation

General: Atom Feed Atom Feed  .. 
Help:  ..   .. 
Category:  ..   .. 
Media:  ..   .. 
Posts:  ..   ..   .. 

Statistics

Page:  .. 
Summary:  .. 
1 Tags
10/10 Page Rank
5 Page Refs
1s Time